Speech Shift: Speech Input Interface Using Intentional Control of Voice Pitch
نویسندگان
چکیده
あらまし 本論文では,非言語情報の一つである音高を利用した,「音声シフト」という新たな音声入力インタ フェース機能を提案する.従来の音声認識システムが主に言語情報だけを利用してきたのに対し,我々は非言語 情報を積極的に活用することによって,音声のもつ潜在能力を引き出した使いやすいインタフェースを構築する ことを目指している.音声シフトでは,普通に発声した発話と故意に高く発声した発話を異なる入力モードに割 り当てることで,音声のみでモード指定と情報入力とを同時に行うことを可能にする.例えば,音声ディクテー ションにおいて,「改行」と普通に発声するとその文字が入力され(文字入力モード),それを高く発声すると行 末が改行される(コマンドモード)機能が実現できる.こうした機能を実現するために,本研究では,故意に高 い発声を識別する際に必要となる話者ごとの音高の基準を,有声休止区間の音高を用いて推定する手法も提案す る.実際に,音声テキストエディタに応用し,理工系男性 20 人の被験者による評価実験をしたところ,音声シ フトが使いやすく,効果的な入力方法であることが分かった. キーワード 音声インタフェース,音声シフト,音声認識,音高,非言語情報
منابع مشابه
Speech shift: direct speech-input-mode switching through intentional control of voice pitch
This paper describes a speech-input interface function, called speech shift, that enables a user to specify a speech-input mode by simply changing (shifting) voice pitch. While current speech-input interfaces have used only verbal information, we aimed at building a more user-friendly speech interface by making use of nonverbal information, the voice pitch. By intentionally controlling the pitc...
متن کاملVoice as Sound : Using Non - verbal Voice Input for
We describe the use of non-verbal features in voice for direct control of interactive applications. Traditional speech recognition interfaces are based on an indirect, conversational model. First the user gives a direction and then the system performs certain operation. Our goal is to achieve more direct, immediate interaction like using a button or joystick by using lower-level features of voi...
متن کاملA wavelet- and neural network-based voice interface system for wheelchair control
Voice control has long been considered as a natural mechanism to assist powered wheelchair users. However, one implementation difficulty is that a voice input system may fail to recognise a user’s voice. Indeed, speech activated interface between human and autonomous/semi-autonomous systems requires accurate detection and recognition. In this area pitch and end-point detection is of vital impor...
متن کاملبررسی تأثیر دیرش نمونه گفتار بر زیروبمی عادتی در زنان طبیعی 18 تا 30 ساله
Introduction: habitual pitch perception associated with the mean fundamental frequency of speech. In the clinical evaluation referred to this issue is dealt with in the normal range for a person whether he is a habitual pitch. A common feature in many of the abnormal pitch of voice disorders, the assessment of habitual pitch and factors affecting it, may help scientists to determine the exist...
متن کاملSpeech Spotter: On-demand Speech Recognition in Human-Human Conversation on the Telephone or in Face-to-Face Situations / Masataka Goto
This paper describes a novel speech-interface function, called “speech spotter”, which enables a user to enter voice commands into a speech recognizer in the midst of natural human-human conversation. In the past, it has been difficult to use automatic speech recognition in human-human conversation since it was not easy to judge, from only microphone input, whether a user was speaking to anothe...
متن کاملSpeech spotter: on-demand speech recognition in human-human conversation on the telephone or in face-to-face situations
This paper describes a novel speech-interface function, called “speech spotter”, which enables a user to enter voice commands into a speech recognizer in the midst of natural human-human conversation. In the past, it has been difficult to use automatic speech recognition in human-human conversation since it was not easy to judge, from only microphone input, whether a user was speaking to anothe...
متن کامل